Searchable Metaspaces

نویسندگان

  • Steve Berman
  • Stefan Evert
  • Ulrich Heid
چکیده

The purpose of this presentation is to start a discussion about methodological and operational requirements for developing tools for internet browsing and/or querying of metadescriptions of language resources, in particular multimodal corpora. Among the most important requirements are: delimiting the relationship both between meta-descriptions and the resources they apply to, and between browsing and querying over the internet; establishing a standard for representing meta-descriptions; administering the web-based availability of language resource and their accessibility via meta-descriptions; and establishing user support for query editing and data interchange. We attempt to stake out positions regarding these requirements, addressing both their advantages and disadvantages. We base our positions on the EAGLES/ISLE proposal for a meta-description standard for language resources ([1]). Our views are also in uenced by work on the development of query languages for linguistic resources, such as CQP, the MATE query language Q4M, and the TIGER query language for syntactic tree annotations. 1 Overview of Objectives With the increasing development and use of multi-modal language resources, there is a growing need for suitable tools to access and query these resources. To facilitate these tasks, it is common and essential for the resources to be associated with meta-descriptions of their content (including object data annotations). Two perspectives are possible and relevant in this context: The local or site perspective: an institution has a (number of) multi-modal resource(s), and somebody wants to identify parts of these resources that satisfy certain meta-descriptions. Possibly one even wants to retrieve, from such resource(s), certain subsets (say turns, sentences, whole dialogues), according to a combination of metadata and linguistic (or other modality-speci c) criteria annotated in the resource. The resources are accessed locally and the search is also carried out on site. The global or web perspective: somebody wants to know about the existence of resources of a certain kind (i.e. satisfying certain conditions in terms of meta-descriptions); if a web search engine accepts the required meta-descriptions, then the resources can be located by browsing, and possibly even accessed and queried over the web. Although these perspectives make di erent demands on implementation, we will see that, from the point of view of the language resource user, they complement rather than compete with each other; hence, resource owners should accommodate both perspectives. Following the EAGLES/ISLE http://www.ims.uni-stuttgart.de/projekte/CorpusWorkbench/ http://www.cogsci.ed.ac.uk/ dmck/MateCode/ http://www.coli.uni-sb.de/cl/projects/tiger/

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Browse searchable encryption schemes: Classification, methods and recent developments

With the advent of cloud computing, data owners tend to submit their data to cloud servers and allow users to access data when needed. However, outsourcing sensitive data will lead to privacy issues. Encrypting data before outsourcing solves privacy issues, but in this case, we will lose the ability to search the data. Searchable encryption (SE) schemes have been proposed to achieve this featur...

متن کامل

Searchable Metaspaces 1 Overview of Objectives

The purpose of this presentation is to start a discussion about methodological and operational requirements for developing tools for internet browsing and/or querying of meta-descriptions of language resources, in particular multimodal corpora. Among the most important requirements are: delimiting the relationship both between meta-descriptions and the resources they apply to, and between brows...

متن کامل

Fuzzy retrieval of encrypted data by multi-purpose data-structures

The growing amount of information that has arisen from emerging technologies has caused organizations to face challenges in maintaining and managing their information. Expanding hardware, human resources, outsourcing data management, and maintenance an external organization in the form of cloud storage services, are two common approaches to overcome these challenges; The first approach costs of...

متن کامل

Implementing Persistent Objects in the Apertos Y Operating System

This paper presents a way of providing users with a persistent object running under the Apertos operating system. We present an implementation of persistent objects by using object migration between metaspaces in the re ective object architecture. An Apertos object is stored into stable storage by migrating to a storage metaspace that is an abstraction of object storage. We also present the cur...

متن کامل

SESOS: A Verifiable Searchable Outsourcing Scheme for Ordered Structured Data in Cloud Computing

While cloud computing is growing at a remarkable speed, privacy issues are far from being solved. One way to diminish privacy concerns is to store data on the cloud in encrypted form. However, encryption often hinders useful computation cloud services. A theoretical approach is to employ the so-called fully homomorphic encryption, yet the overhead is so high that it is not considered a viable s...

متن کامل

Searchable Translation Memories

In this paper we introduce a technique for creating searchable translation memories. Linear B’s searchable translation memories allow a translator to type in a phrase and retrieve a ranked list of possible translations for that phrase, which is ordered based on the likelihood of the translations. The searchable translation memories use translation models similar to those used in statistical mac...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000